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The immunodeficiency, centromere instability and facial anomalies (ICF) syndrome is associated to mutations of the 
DNA methyl-transferase DNMT3B, resulting in a reduction of enzyme activity. Aberrant expression of immune system 
genes and hypomethylation of pericentromeric regions accompanied by chromosomal instability were determined as 
alterations driving the disease phenotype. However, so far only technologies capable to analyze single loci were applied 
to determine epigenetic alterations in ICF patients. In the current study, we performed whole-genome bisulphite 
sequencing to assess alteration in DNA methylation at base pair resolution. Genome-wide we detected a decrease of 
methylation level of 42%, with the most profound changes occurring in inactive heterochromatic regions, satellite repeats 
and transposons. Interestingly, transcriptional active loci and rRNA repeats escaped global hypomethylation. Despite 
a genome-wide loss of DNA methylation the epigenetic landscape and crucial regulatory structures were conserved. 
Remarkably, we revealed a mislocated activity of mutant DNMTSBto H3K4me1 loci resulting in hypermethylation of active 
promoters. Functionally, we could associate alterations in promoter methylation with the ICF syndrome immunodeficient 
phenotype by detecting changes in genes related to the B-cell receptor mediated maturation pathway. 



Introduction 

Alterations of the epigenetic landscape and single gene loci are 
associated to different types of diseases. Although cancer is 
the most frequently studied phenotype other diseases related 
to cardiovascular, neurological and autoimmune defects reveal 
variations regarding their epigenetic profiles.^ The majority of 
observed alterations are affecting DNA methylation, a modi- 
fication of cytosines introduced by DNA methyl-transferases 
(DNMTs). DNA hypermethylation in particular in gene pro- 
moter regions results in impaired transcription initiation and 
subsequent gene silencing.^ While genome-wide the majority of 
cytosines in a CpG context are methylated, focal regions of low 
methylation level, often associated to CpG rich regions (CpG 
islands), secure the accessibility of the transcription machinery 
and hence facilitating gene activation.^ Throughout cell divi- 
sion DNA methylation is maintained by DNMTl, whereas 
methyl-groups are introduced de novo by DNMT3A and 
DNMT3B. 

Recombinant mouse models deficient for DNMT3B expres- 
sion revealed profound developmental defects resulting lethality 
during embryo development or short after birth. ^ Interestingly, 
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in humans homozygous mutations of DNMT3B, suggested to 
result in a reduced activity rather than a complete functional 
impairment, lead to the development of the immunodeficiency, 
centromere instability and facial anomalies (ICF) syndrome.^ *^ 
The disease phenotype is characterized by immunodeficiency, 
due to decreased numbers of mature B cells causing agamma- 
globulinemia. Furthermore, chromosomal instability, mainly 
observed in juxtacentromeric regions of chromosome 1, 16 and 
to a lesser extent of chromosome 9, and facial anomalies are fre- 
quent disease features. Although the majority of genetic defects 
in ICF patients are related to DNMT3B (Type 1), genetic altera- 
tion in ZBTB24 were also associated to the disease phenotype 
(Type 2) 7'' 

From an epigenetic point of view DNA hypomethylation was 
observed at satellite repeats (SAT2/3) of chromosome 1, 9, 16 
and 21.^'^^ Of note, hypomethylation of SAT2 repeats on chro- 
mosome 1 and 16 and missegregation of both chromosomes 
was also observed after treatment with the DNMT inhibitor 
5-Azacytidine.^^ RNAi mediated knockdown of DNMT3B in 
embryonic stem cells resulted in similar methylation changes 
as observed in the ICF patient. Here, whole-genome bisulphite 
sequencing (WGBS) revealed large hypomethylated regions in 
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Figure 1. Whole genonne bisulfite sequencing of a healthy (CTRL) and an ICF patient (ICF) sample. (A) Circos representation of genome-wide DNA 
methylation levels in the CTRL and ICF samples. Average levels for all the CGs in 297 10-Mbp-wide windows. Inner track indicates the magnitude of 
the difference between the CTRL and the ICF individual for each window (color scale and red line). Average methylation levels in all the regions are 
expressed as p values (0-1) and color-coded (blue). (B) DNA methylation level of centromeric regions of all 22 autosomes and the X chromosome 
displayed as ratio of methylated reads to unmethylated reads. (C) Genome-wide DNA methylation level of 22 autosomes and the X chromosome dis- 
played as ratio of methylated reads to unmethylated reads. (D) DNA methylation profile of chromosome 1 for the CTRL (red) and the ICF sample (blue). 
(E) DNA methylation profile of chromosome X for the CTRL (red) and the ICF sample (blue). (F) DNA methylation level of genomic features displayed as 
ratio of methylated reads to unmethylated reads. (G) DNA methylation level of regulatory elements displayed as ratio of methylated reads to unmeth- 
ylated reads. (H) DNA methylation profile of the promoter region (TSS ±2 kb) of protein-coding genes for CpG rich (top) and poor (bottom) promoters 
of the CTRL (red) and ICF patient (blue) sample. 
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particular on the X chromosome and validated the DNA methyl- 
ation loss in pericentric regions. In addition to changes in DNA 
methylation, ICF patients reveal alterations in the expression of 
protein coding genes^^ and microRNAs.^^ 

Using WGBS of lymphoblastoid cell lines (LCLs) obtained 
from an ICF patient harboring mutated DNMT3B and one 
healthy control, we unraveled changes in DNA methylation at 
base-pair resolution and determined yet unknown alterations 
associated to the disease phenotype. 

Results and Discussion 

Previous studies identified epigenetic changes in ICF patients 
harboring DNMT3B mutations in terms of locus-specific DNA 
methylation and histone modifications. ^^'^^ Here we focus on 
DNA methylation, the direct effect of altered DNMT3B func- 
tion. To get an unbiased insight of changes in DNA methylation 
occurring genome-wide, we performed whole-genome bisulphite 
sequencing (WGBS) of lymphoblastoid cell lines (LCLs) obtained 
from a female ICF patient (Caucasian, 1 y old) harboring mutated 
DNMT3B (A603T/STP807ins;^6'^7 ICF) and one healthy gen- 
der-matched control sample (CTRL; Caucasian, 4 y old). All data 
are freely available at the Gene Expression Omnibus (GSE37578, 
www.ncbi.nlm.nih.gov/geo/query/acc.cgi ?acc = GSE37578). 
Overall, we generated 667 million reads for the CTRL and ICF 
samples from which 88.8 and 85.9% mapped uniquely to the ref- 
erence genome (hgl9), respectively. Genome-wide we obtained 
DNA methylation information for 93% and 90% of Cs and 94% 
and 93% of CGs with an average coverage of 9.9- and 9.5 -fold for 
the CTRL and ICF sample, respectively. Comparing global CpG 
methylation level for both samples revealed profound decreased 
levels in the ICF patient. Here, the average methylation level of 
CpG sites dropped from 59.7% to 34.4% and in total we iden- 
tified 8,955,695 (32%) significantly differentially methylated 
CpG sites (x^ test, p < 0.05). The full set of WGBS data from 
CTRL and ICF is illustrated in (Fig. lA), using Circos.^^ In line 
with previously published reports using single loci approaches, 
we detected a decrease in methylation in centromeric satellite 
repeats of the chromosomes 1, 9 and 16. However, screening 
for differentially methylated site in an ICF patient at base-pair 
resolution for the first time revealed hypomethylated centromeric 
satellite regions on all other chromosome, even to larger extents 
(Fig. IB). In addition to the centromeres a global loss 41% 
(27-52%) of methylation was detected for all autosomes 
(Fig. IC). Most importantly, as previously suggested,^^ chromo- 
some X showed more profound losses (63%) indicating a gen- 
eral failure in heterochromatic regions. Displaying average DNA 
methylation level along the chromosome determined additional 
hypomethylated regions outside of centromeric repeat-rich 
regions spanning the entire chromosome as displayed for chromo- 
some 1 and X (Fig. ID and E). Overall, a loss of methylation was 
abundant along all genomic features, with losses detected in pro- 
moters, exons, introns, intragenic regions (Fig. IF). However the 
loss of DNA methylation was more intense in non-genic regions. 

From a regulatory standpoint, DNA methylation decreased in 
CpG rich and poor promoters, however to a higher extend in the 



latter (Fig. IG). The same effect was observed for CpG islands, 
showing conserved hypomethylation at the CpG rich islands and 
an increased loss of DNA methylation at the island shores in the 
ICF patient. Interestingly, the shape of DNA methylation was 
conserved around the transcription start site (TSS) in CpG rich 
promoters, with changes preferentially occurring at the edges of 
the hypomethylated regions (Fig. IH). While genome-wide a 
drop of methylation from 60% to 34% was observed, proximate 
flanking regions of hypomethylated loci maintain a methylation 
level of around 50% suggesting the surrounding sites and shape 
of the hypomethylated region important to be preserved. In can- 
cer cells CpG island shores were identified as crucial elements 
for gene regulation with high variable methylation level between 
normal and diseased tissue. ^^'^^ In ICF patients aberrant meth- 
ylation in CpG island shores might be responsible for previous 
reported changes in expression. In addition to the conserved 
DNA methylation structure at promoters, CpG rich promoter 
containing genes were described as strongly expressed with a 
low evolutionary rate,^^ supporting a positive selection on epi- 
genetic and genetic level. For CpG poor promoters, we observed 
a profound loss of DNA methylation at the TSS and the sur- 
rounding regions, however with conserved shape of the overall 
structure (Fig. IG and H). Comparing protein-coding and non- 
coding genes presented similar tendencies, with the non-coding 
RNA gene promoters showing general higher methylation levels 
(Fig. SI). 

Regional enrichment of differential methylation. Displaying 
the methylation profile of the healthy control and ICF patient 
along the chromosomes, revealed a more profound drop of 
DNA methylation in the Giemsa-positive chromosomal regions, 
those representing loci of dense chromatin compaction (Fig. ID 
and E). Accordingly, the average methylation level significantly 
decreased in the ICF patient with Giemsa staining intensities 
(Fig. 2A; Student's t test; p < 0.01). As DNMT3B functions are 
closely related to chromatin structure, especially heterochromatin 
formation, we analyzed the DNA methylation profile of regions 
presenting distinct histone marks in more detail. Here, we took 
advantage of chromatin immunoprecipitation sequencing (ChlP- 
seq) experiments, enrolled in the ENCODE project processing a 
lymphoblastoid cell line of a healthy donor (GM12878), enabling 
us to determine positions occupied by distinct histone marks. 
Analyzing nine different histone marks and regions occupied by 
CTCF and H2A.Z for their average DNA methylation level on 
the concomitant positions, revealed a significant loss of methyla- 
tion in the repressive Polycomb mark H3K27me3 loci, the repres- 
sive heterochromatin mark H3K9me3 loci^^'^^ and H4K20mel 
loci, previously associated to repressed^^ and active^^'^^ genes 
(Fig. 2B;X^ test; p < 0.01). All marks were previously associated to 
be enriched in the inactive X chromosome. Furthermore, genes 
harboring the repressive histone mark H3K27me3 were previ- 
ously associated with DNMT3B activity.^^ Also the CTCF and 
H2A.Z bound regions revealed a significant loss of DNA methyl- 
ation (x^ test; p < 0.05). While for CTCF a clear suppressive func- 
tion was described, H2A.Z binding is rather associated to active 
gene promoters. However, in mice models an association to peri- 
centric heterochromatin has been observed.^^ Most importantly. 
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Figure 2. Distinct regions of aberrant methylation in the healthy (CTRL) and the ICF patient (ICF) sample. (A) DNA methylation level of genomic 
regions defined by Giemsa-staining. Displayed are all regions of one class, gneg: Giemsa-negative; gpos (25-100): Giemsa-positive (light-strong) as 
boxplot for the CTRL (red) and the ICF (blue) sample, with ^Vindicating significance (p < 0.01). (B) Average methylation level of genomic loci occupied 
by different histone marks, H2A.Z and CTCF. Displayed are the average DNA methylation levels in the analyzed regions, with significant differences 
indicated (**p < 0.01, *p < 0.05). (C) Relative reduction of average DNA methylation in different repetitive elements of the ICF sample compared with 
the CTRL. (D) Correlation of the DNA methylation status of neighboring CpGs in the CTRL (red) and the ICF (blue) sample. (E) Characterization of HMRs 
present in the CTRL sample in terms of DNA methylation level (Uscore) and CpG content. (F) Characterization of HMRs present in the ICF sample in 
terms of DNA methylation level (Uscore) and CpG content. 



when we analyzed the autosomes separately, all aforementioned 
marks remained significantly altered, suggesting the regions to be 
affected genome-wide and not driven by the highly hypomethyl- 
ated X chromosome. In contrary, histone marks clearly associated 
to transcriptional activity lost DNA methylation to a much lesser 
extent.^^ Particularly, the DNA methylation level of H3K79me2 
and H3K36me3 enriched regions almost remained unchanged, 
by showing only 6% reductions and highly behaving against the 
genome-wide trend. 

Furthermore, we aimed to analyze alterations occurring in 
repetitive elements in the ICF patient in more detail. Therefore, 
we extracted all sequencing reads with multiple mapping to the 
reference genome and analyzed them separately to extract DNA 
methylation changes in transposons and repeats. Strikingly, dif- 
ferent repeat families appeared to be altered to variant extends, 
with satellite repeats revealing the highest reduction (-76%; 
Fig. 2C). Furthermore, transposons (RC, -66%; LINE, -58%, 
LTR, -57%) showed a very high reduction and RNA repeats 
(scRNA, -22%, rRNA, -27%, tRNA, -34%) the lowest reduction 
of DNA methylation, suggesting DNMT3B activity crucial for 
centromere stability as well as transposon repression (Fig. 2C). 



When we analyzed in particular the methylation status of 
neighboring CpG sites, we unexpectedly observed, regarding 
the massive changes observed genome-wide, that the DNMT3B 
mutant patients did not show significant less correlation between 
neighboring sites, suggesting the global hypomethylation to be 
due to a reduced activation, rather than a loss introduced by 
errors /failure of DNMT3B at random sites (Fig. 2D). Therefore, 
we suggest that the here analyzed mutations are causing a reduc- 
tion of activation of DNMT3B, while maintaining the enzyme's 
processivity. This might be due to an impaired interaction with 
DNMT3L as previously shown for DNMT3B mutations in 
ICF patients. Indeed, the A603T variant, known to be SAM 
binding deficient, also losses the ability to form homo-hetero- 
oligomers with DNMT3B and DNMT3L proteins in vitro 
and in vivo.^^'^' As interaction of DNMT3B with DNMT3L 
highly increases the activity of the methyl-transferase,^^ we 
suggest that an impaired binding of DNMT3B to its activa- 
tor is responsible for the global hypomethylation by paralleled 
conserved DNA methylation structures. Detecting an homog- 
enously methylated genome in both samples suggests that the 
overall epigenomic landscape is maintained, consistent with the 
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Figure 3. Differentially methylated regions (DMRs) in the healthy (CTRL) and the ICF patient (ICF) sample. (A) Distribution of DMR observed between 
the samples in terms of DNA methylation level and genomic location. (B) DNA methylation profiles of the promoter regions of the germline-specific 
genes TDRDl (left) and TDRD9 (right) in the CTRL (green) and the ICF (blue) sample. DMRs (red bars) and CpG islands (black bars) are indicated. (C) En- 
richment ratios (positive value: overrepresentation; negative values: underrepresentation) of hyper-DMRs in different histone marks, H2A.Z and CTCF. 
(D) Relative position counts of hyper-DMRs in H3K4me1 marked promoters. Displayed are relative counts for H3K4me1 occupancy and hyper-DMRs 
in a 10 bp window. (E) DNA methylation profile of promoter regions (TSS ±2 kb) overlapping with hyper-DMRs of the CTRL (red) and the ICF (blue) 
sample. (F) DNA methylation profiles of the promoter regions of the B-cell maturation associated genes VAV1 (left), NCKl (middle) and GBR2 (right) in 
the CTRL (green) and the ICF (blue) sample. DMRs (red bars) and CpG islands (black bars) are indicated. (G) DNA methylation profiles of the promoter 
regions of the BTK activator SYK (left) and the BTK repressor SH3BP5 (right) in the CTRL (green) and the ICF (blue) sample. DMRs (red bars) and CpG 
islands (black bars) are indicated. 



aforementioned maintenance of CpG rich promoter structures 
(Fig. IH). 

Hypomethylated regions (HMRs). As the ICF patient 
displayed a global loss of DNA methylation at high magni- 
tude, we wondered if this affects the genome-wide structure of 
DNA methylation fingerprints such as hypomethylated regions 
(HMRs).^^'^^ HMRs are of special interest as they present loci 
of regulatory potential and were previously shown to alter their 



methylation level and shape following differentiation from hema- 
topoietic stem/progenitor cells into myeloid or lymphoid lin- 
eages. As an impaired maturation of B-cells is observed in ICF 
patients, deregulation of HMRs might contribute to the disease. 
Therefore, we assessed and characterized HMRs throughout the 
genome and compared these between both samples. We observed 
that the number of HMRs more than tripled in the ICF patients. 
While healthy cells harbored around 48,000 distinct regions of 
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hypomethylation, the ICF patient reveals 164,000 HMRs. The 
changes are even more striking considering the size and CpG 
content of the HMRs in the patient samples as they cover 38.2% 
of the genome and harbor 39.3% of all CpGs. In contrary, HMRs 
in the healthy control sample represent only 3.1% of the genome 
and 9.9% of CpG sites. This is also reflected by the geometric 
mean sizes of 1.4 kb (IC 95%: 0.2-8.4 kb) vs. 5.0 kb (IC 95%: 
0.8-30.9 kb) of the control and ICF sample, respectively. 

To further characterize the distinct HMR populations in the 
healthy and ICF samples, we assessed HMR size, CpG content 
and score^^'^^ determines the number of CpG sites in an HMR 
weighted inversely by their methylation status (more meth- 
ylation, lower value). The Uscore presents a normalized score 
value (Uscore = score/CpG) to allow comparisons independent 
of HMR length. While the healthy cells revealed mainly CpG 
dense, highly hypomethylated loci and some CpG poorer regions 
with moderate hypomethylation, the ICF patient presents CpG 
poor regions with low to intermediate DNA methylation levels 
(Fig. 2E and F). Although we observed a global hypomethyl- 
ation of the genome displayed by large blocks of reduced DNA 
methylation, a subpopulation of HMRs kept their size and com- 
position probably enabling ubiquitous expression by structural 
conservation of gene promoters crucial for cell survival. These 
conserved regions displayed a high CpG content and low level 
of DNA methylation, features frequently observed at CpG rich 
promoters, previously identified to maintain their DNA methyla- 
tion structure. 

Differentially methylated regions (DMRs). In order to iden- 
tify specific loci differentially methylated between the control 
and ICF sample, we screened the methylomes for regions con- 
sistently changing between both samples. DMRs were defined as 
regions of at least five consistently differentially methylated CpG 
sites between the control and ICF sample. In total, we assigned 
315,813 DMRs present in all chromosomes. Consistent with 
before reported genome-wide changes, the DMR represented 
almost exclusively hypomethylation (296,964 out of 315,813) in 
the ICF patient (Fig. 3A). Interestingly, despite the global loss of 
methylation distinct regions displayed a gain of methylation level 
in the ICF patient. In particular, we found 18,849 (6.0%) DMRs 
representing a gain of methylation in the ICF patient, displaying 
a distinct genomic distribution compared with their hypomethyl- 
ated counterparts. The majority of hypo-DMRs (57.8%) mapped 
to intergenic regions, 37.7% were overlapping gene bodies (intra- 
genic) and 4.5% were located in gene promoters. In contrast, the 
majority of hyper-DMRs were associated to genie regions with 
66.6% and 11.8% mapping to gene bodies and promoter regions, 
respectively. Analyzing the chromosomal distribution of DMRs 
revealed an increased abundance on the X chromosome than on 
the autosomes with 55% and 41% of the chromosomes covered 
by the DMRs, respectively. 

Although genomic hypomethylation appeared throughout the 
genome, the loss of methylation at CpG rich gene promoters did 
not present diffuse pattern but appeared rather organized ensur- 
ing expression of genes crucial for survival. In line, we detected 
tissue-specific genes highly methylated in the healthy control that 
revealed a de novo established HMR in their promoters. Here 



in particular testis-cancer specific genes of the TUDOR family 
(TDRDl and TDRD9) presented sharply structured de novo 
HMRs in their promoter (Fig. 3B) accompanied by re-expression 
of the testis-specific genes in B-cells of ICF patients, as previously 
detected in reference 13. 

In the here analyzed disease context hypermethylated regions 
represent a special event as they behave against the global loss 
of DNA methylation, but also against the expectations, as ICF 
patients harbor mutations in DNMT3B, repressing its activ- 
ity. DNMT3B binding to H3K4 was previously described to be 
impaired by DNMT3L upon mono-, bi- or tri-methylation of 
the H3K4 residue. In ICF patient, we detected an enrichment 
of hypermethylated DMRs in particular at methylated H3K4 
marked loci, suggesting the residual activity of DNMT3B to 
be misguided to those regions (Fig. 3C).^^ Here, mislocated 
DNMT3B activity might be mediated by an impaired inter- 
action with DNMT3L as previously determined for different 
DNMT3B mutants in ICF patients. DNMT3L interacts with 
unmethylated H3K4 tails, whereas H3K4 methylation blocks 
the binding and subsequent DNA methylation of the marked 
regions by DNMT3B.^^'^^ Consistently, we detected an enrich- 
ment of hypermethylated DMRs at sites previously inhibited 
by DNMT3L binding such as methylated H3K4 (Fig. 3C). As 
especially H3K4mel marks the boundaries flanking the TSS^^ 
(Fig. 3D; Fig. S2), an mislocated activity of DNMT3B in the 
flanking region could be suspected. Indeed, displaying promoter 
sites harboring hypermethylated DMRs revealed an increase of 
methylation in the sharply dropping boundaries, resulting in 
more narrow HMRs of CpG prich and poor promoters (Fig. 3E). 
In addition, hyper-DMRs reveal a promoter distribution highly 
similar to the positioning of H3K4mel (Fig. 3D). Here, chroma- 
tin immunoprecipitation bisulphite sequencing (ChlP-BS-seq^^) 
of H3K4mel of healthy and ICF samples could clearly clarify if 
these regions gain methylation in the patient. 

Analyzing genes harboring hyper-DMRs in their respective 
promoter regions by gene ontology (GO) analysis (biological 
process, level 5) revealed an enrichment of basic cellular mecha- 
nisms, such as RNA metabolism, regulation of transcription, cel- 
lular biosynthesis, nucleobase, nucleoside, nucleotide and nucleic 
acid metabolism, macromolecule biosynthesis and transferase 
activity (FDR, p< 0.05). 

Therefore we propose a model in which for certain consti- 
tutively active housekeeping genes the promoter structure is 
maintained to ensure transcription. However, these genes pres- 
ent slightly narrowed gene promoter HMRs in ICF patients, 
suggesting that the responsible mechanism is failing to entirely 
restore the original status of the healthy donor (Fig. 3E). We 
suggest that mislocated DNMT3B activity is hypermethylating 
sharp promoter boundaries, which are marked and protected by 
H3K4mel in the original state (Fig. 3D). In wild-type cells bind- 
ing of DNMT3B to methylated H3K4 is impaired by DNMT3L 
resulting in hypomethylated H3K4 marked regions and wider 
promoter HMRs. 

From a disease point of view, genes gaining methylation are 
associated to B-cell maturation, a function impaired in ICF 
patients causing agammaglobulinemia and severe immunological 
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defects. Although based on different genetic alteration, ICF 
patients share phenotypical overlap with patients suffering from 
congenital agammaglobulinemia,^^ which also present an early 
onset of recurrent infections. This X chromosome-linked linked 
disease is caused by mutation of the Bruton's tyrosine kinase 
(BTK), resulting in an impaired B-cells maturation and immune 
system failure. BTK is involved in the signal transduction acti- 
vated by the B-cell receptor (BCR), which involves additional 
molecules such as SYK and the B-cell linker (BLNK), a scaffold 
protein that binds BTK, PLC72, GRB2, VAVl and NCK1.39,4o 
Considering similarities in the disease phenotype of congeni- 
tal agammaglobulinemia and ICF, it is tempting to speculate 
that the BCR pathway is also altered in ICF, although not by 
genetic but epigenetic alterations. Consistently, we found hyper- 
methylated DMRs in the promoter region of GRB2, VAVl and 
NCKl creating smaller HMRs overlapping the TSS (Fig. 3F). 
In line, we found further genes of the BCR pathway epigeneti- 
cally altered. In particular, the BTK activator SYK revealed a 
hypermethylated DMR and the BTK repressor SH3BP5 a 
hypomethylated DMR in their promoter regions (Fig. 3G). It is 
of note that the hypomethylation of the SH3BP5 promoter was 
not diffuse, but rather creating a sharply formed HMR likely 
to favor genes expression through its transcription promoting 
structure. Most importantly, the expression of ^FTTand SH3BP5 
were shown to be down and upregulated in ICF patients, respec- 
tively, consistent with their promoter methylation profiles. 
Interestingly, ZBTB24, the second gene genetically altered in 
ICF Type 2 patients harbored a hypermethylated DMR in its 
promoter. Although not mutated, inactivating hypermethylation 
of ZBTB24 might contribute to the Type 1 disease phenotype as 
apparent in ICF Type 2 patients. 

Materials and Methods 

Sample preparation. Normal B cells of an healthy donor (female, 

4 y) were immortalized by Epstein-Barr-Virus (EBV) applying 
previous published protocol in reference 42. Lymphoblastoid cell 
line generated from an ICF patient (GM08714, female, 1 y) was 
obtained from the Coriell Cell Repositories. DNA was extracted 
using Phenol: Chloroform :Isoamylalcohol (Sigma). 

Whole-genome bisulphite sequencing. For normal BS-seq 
library constructing, 10 |JLg genomic DNA was fragmented using a 
Covaris sonication system (Covaris S2). Following fragmentation, 
libraries were constructed using the Illumina Paired-End proto- 
col consisting of end repair, < A > base addition and methylated- 
adaptor ligation. Ligated DNA was bisulfite converted using the 
EZ DNA Methylation- Gold kit (ZYMO). Different insert size 
were excised from the same lane of a 2% TAE agarose gel. Products 
were purified by using QIAquick Gel Extraction kit (Qiagen) and 
amplified by PGR. PGR was performed in a final reaction volume 
of 50 |jl1 consisting of 20 |jl1 purified DNA, 4 |jl1 2.5 mM dNTP, 

5 |xl lOx buffer, 0.5 |xl JumpStart™ Taq DNA polymerase, 2 |jl1 
10 uM PGR primers and 37.5 \xX water and the following thermal 
cycling program: 94°C 30 sec, 10 cycles of 94°C 30 sec, 60°C 
30 sec, 72°C 30 sec then prolong with 1 min at 72°C. Sequencing 
was performed using the HighSeq2000 (Illumina). 



Data processing. We performed the sequences alignment and 
methylation calling with version 2.43 of BSMAP software.^^ 
All genomic information, unless otherwise expressly stated, was 
obtained using CellBase. SAM/BAM and BED files handling 
was done using SAMtools^^ and bedtools.^^ Statistical analysis 
and graphic representation was performed with R (www.R-proj- 
ect.org) and libraries multicore and ggplot2. We defined the pro- 
moter region as 2 kb flanking the transcription start site (TSS). 
TSS was considered to be the upstream-most base of all the tran- 
scripts of the gene. When depicting methylation profiles, only 
CpG dinucleotide sites with coverage above or equal to 4 were 
considered. 

Distance correlation. We assessed distance correlation of 
CpG in close proximity gathering information about methyla- 
tion and relative distance up to 1,000 bases away from 2,000 
randomly selected CpG sites and correlating pairwise methyla- 
tion at single CpG sites for each relative distance. 

Hypomethylated regions. The hypo-methylated regions 
(FiMR) were identified via a hidden Markov model segmentation 
algorithm as previously described by Molaro et al.^^ 

Differentially methylated regions. Differentially methylated 
regions (DMR) were identified by considering all homozygous 
CpG dinucleotides, and searching for maximal length sequences 
of consecutive CpG's with a consistent direction of methylation 
change between the two samples for all CpG's in the region, 
and where the first and last CpG dinucleotides of a regions that 
contains minimum 5 CpGs showed a statistic for differential 
methylation of > 3.84, corresponding to a p value of < 0.05 for 
both residues. 

Conclusion 

Although ICF patients show a massive loss of DNA methyla- 
tion throughout all genomic regions and genetic features, some 
structures are maintained in their original state, suggesting a 
selective pressure to keep those features stable to ensure cell sur- 
vival. Despite a global loss of DNA methylation in DNMT3B 
mutant cells, the shape of genetic features, such as promoters or 
CpG islands is conserved. Approaching genetic structures by the 
analysis of hypomethylated region supports the idea of structural 
addiction to ensure survival as a subpopulation of small CpG 
rich loci with low levels of methylation was conserved in the 
ICF patient, against the global trend of wide hypomethylated 
regions. Furthermore, DNMT3B mutant cells do not show a 
decrease of correlation between neighboring CpG sites favoring 
the idea that although flattened, the global DNA methylation 
landscape is maintained. Unexpectedly, distinct hypermeth- 
ylated loci were observed and associated to Fi3K4 occupied 
regions, suggesting a mislocated activity of DNMT3B in ICF 
patients due to the lack of control by DNMT3L. As DNMT3L 
binding to DNMT3B also highly increases the activity of the 
methyl-transferase, the impaired interaction reduces both, activ- 
ity and specificity of the enzyme, with effects visible in the ICF 
patient's DNA methylome. 

Functionally, ICF patients present hypermethylated genes 
involved in B-cell maturation and previously associated to 
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congenital agammaglobulinemia. Induction of these genes 
by DNMT inhibitors, such as 5-Azacytidine, might serve 
as possible treatment strategy for ICF patients mutant 
for DNMT3B. 
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